Learning multi-objective robot control policies from demonstration

نویسندگان

  • Daniel H Grollman
  • Odest Chadwicke Jenkins
چکیده

When demonstrating unknown robot tasks via teleoperation, a human user may leverage information, latent in their mind, that is not observable to the robot. Such information may include user preferences as to how a task should be performed, state information observable to the human but not the robot, or task structure information such as subtask objectives. Multiple, different actions may thus occur in what the robot perceives to be the same state. The observed mapping from perceived states to actions, π : ŝ → a, may then be a one-to-many multimap, instead of a one-to-one or many-to-one function. Our interactive learning from demonstration architecture [1] enables robot learning from teleoperative demonstration via direct policy approximation (regression). Functional regression algorithms, however, are not appropriate for learning multimap policies (Fig. 1). Instead, we have developed ROGER (Realtime Overlapping Gaussian Expert Regression), a multimap regression algorithm for interactive learning from demonstration (Fig 2). ROGER is based on the Infinite Mixture of Gaussian Processes model [2]. We achieve interactivity with a human user by reformulating it incrementally as a particle filter and using the Sparse Online Gaussian Process formulation [3]. Current work focuses on improving the algorithm’s sparse and realtime properties and applying it to real robot tasks. Whilst learning a multimap in this manner, the overall task is decomposed into a collection of overlapping, functional, experts, each corresponding to a subtask. Properly selecting an expert or subtask at run time is analogous to 0 0.2 0.4 0.6 0.8 1 −1 −0.5 0 0.5 1

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Confidence-Based Multi-Robot Learning from Demonstration

Learning from demonstration algorithms enable a robot to learn a new policy based on demonstrations provided by a teacher. In this article, we explore a novel research direction, multi-robot learning from demonstration, which extends demonstration based learning methods to collaborative multi-robot domains. Specifically, we study the problem of enabling a single person to teach individual polic...

متن کامل

Learning in behavior-based multi-robot systems: policies, models, and other agents

This paper describes how the use of behaviors as the underlying control representation provides a useful encoding that both lends robustness to control and allows abstraction for handling scaling in learning, focusing on multi-agent / robot systems. We first define situatedness and embodiment, two key concepts in behavior-based systems (BBS), and then define BBS in detail and contrast it with a...

متن کامل

Teacher feedback to scaffold and refine demonstrated motion primitives on a mobile robot

Task demonstration is an effective technique for developing robot motion control policies. As tasks becomemore complex, however, demonstration can becomemore difficult. In this work, we introduce an algorithm that uses corrective human feedback to build a policy able to performanovel task, by combining simpler policies learned from demonstration. While some demonstration-based learning approach...

متن کامل

Workspace Boundary Avoidance in Robot Teaching by Demonstration Using Fuzzy Impedance Control

The present paper investigates an intuitive way of robot path planning, called robot teaching by demonstration. In this method, an operator holds the robot end-effector and moves it through a number of positions and orientations in order to teach it a desired task. The presented control architecture applies impedance control in such a way that the end-effector follows the operator’s hand with d...

متن کامل

Teaching Multi-Robot Coordination using Demonstration of Communication and State Sharing (Short Paper)

Solutions to complex tasks often require the cooperation of multiple robots, however, developing multi-robot policies can present many challenges. In this work, we introduce teaching by demonstration in the context of multi-robot tasks, enabling a single teacher to instruct multiple robots to work together through a demonstration of the desired behavior. Within this framework, we contribute two...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008